Routing with Locality on Meshes with Buses
نویسندگان
چکیده
a d 1 O(d/f(d)) step, O( f(d)) buffer size routing algorithm which is asymptotically optimal if f(d) is chosen to be a large constant. In our study, we assume all the processors operate in synchronous MIMD mode. At any time step, each processor can communicate with all of its grid neighbors and can both send and receive one packet along each mesh link. In addition, processors can also store packets in their own queues. This model (hereafter referred to as the base model) is the same as the ones used in [9, 10, 12–16, 22, 25]. The main disadvantage of the mesh topology is its large diameter, which has direct impact on the communication times of many parallel algorithms. Augmenting arrays of processors with various faster mechanisms has been suggested as a means to speed up communication among the processors. Examples are meshes with multiple buses which have a bus in each column and each row [1, 20], generalized meshes with multiple buses which are composed of smaller meshes with multiple buses [8], meshes with separable row and column buses in which row/column buses can be separated into multiple shorter buses through turning on/off bus switches [24], and reconfigurable meshes in which links can be connected to form buses [2]. These enhanced meshes are capable of solving problems that only require a limited amount of global communication significantly faster. This paper shows how to utilize these buses in some ‘‘high-bandwidth’’ routing problems. In all the related bused mesh models, except reconfigurable meshes, broadcast buses are added to the base model. Each broadcast bus is connected to a set of processors. In each time step, only one processor attached to a bus can send a packet via the bus. In addition, a processor can receive packets from all the buses attached to it in a time step. A number of proposed bused mesh models assume the propagation delay of a bus to be a constant which is independent of the number of processors attached to it. This assumption is thought to be a reasonable one in practical situations [1, 2, 3, 20, 27]. However, Lu et al. [18] investigated physical implementations of buses and found that short buses and long buses do have a difference in performance and that the constant-delay assumption is more appropriate with short-bus models. In this paper, we assume the propagation delay of the buses to be one time step which is also assumed in [1, 8, 17, 27]. As we will see, JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING 33, 84–90 (1996) ARTICLE NO. 0027
منابع مشابه
Routing with locality in partitioned-bus meshes
We show that adding partitioned-buses (as opposed to long buses that span an entire row or column) to ordinary meshes can reduce the routing tame b y approximately one-third f o r permutation routing with locality. A matching time lower bound i s also proved. The result can be generalized to multi-packet routing.
متن کاملRouting and Sorting on Meshes with Row and Column Buses
We give improved deterministic algorithms for permutation routing and sorting on meshes with row and column buses. Among our results, we obtain a fairly simple algorithm for permutation routing on two-dimensional meshes with buses that achieves a running time of n + o(n) and a queue size of 2. We also describe an algorithm for routing on r-dimensional networks with a running time of (2? 1=r)n+ ...
متن کاملPermutation Routing and Sorting on Meshes with Row and Column Buses
We study the problems of permutation routing and sorting on several models of meshes with xed and reconngurable row and column buses. We describe two fast and fairly simple deterministic algorithms for permutation routing on two-dimensional networks, and a more complicated algorithm for multi-dimensional networks. The algorithms are obtained by converting two known oo-line routing schemes into ...
متن کاملRandomized Routing on Meshes with Buses
We give algorithms and lower bounds for the problem of routing k-permutations on d-dimensional MIMD meshes with additional buses. A straightforward argument shows that for all d 1, 2=3 n steps are required for routing permutations (the case k = 1) on a d-dimensional mesh. We prove that routing permutations on d-dimensional meshes requires at least (1 ? 1=d) n steps. For small d better lower bou...
متن کاملBAT: Performance-Driven Crosstalk Mitigation Based on Bus-Grouping Asynchronous Transmission
Crosstalk delay within an on-chip bus can induces a severe transmission performance penalty. Bus-grouping Asynchronous Transmission (BAT) scheme is proposed to mitigate the performance degradation. Furthermore, considering the distinct spatial locality of transition distribution on some types of buses, we use the locality to optimize the BAT. In terms of the implementation, we propose the Diffe...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- J. Parallel Distrib. Comput.
دوره 33 شماره
صفحات -
تاریخ انتشار 1996